Murukarthick et al. BMC Research Notes 2014, 7:379 
http://www.biomedcentral.eom/1756-0500/7/379 



DATA NOTE Open Access 



BrassicaTED - a public database for utilization 
of miniature transposable elements in 
Brassica species 

Jayakodi Murukarthick 11 , Perumal Sampath 11 , Sang Choon Lee 1 , Beom-Soon Choi 2 , Natesan Senthil 3 , Shengyi Liu 4 
and Tae-Jin Yang 1 * 



(bmc 

Research Notes 



Abstract 

Background: MITE, TRIM and SINEs are miniature form transposable elements (mTEs) that are ubiquitous and 
dispersed throughout entire plant genomes. Tens of thousands of members cause insertion polymorphism at both 
the inter- and intra- species level. Therefore, mTEs are valuable targets and resources for development of markers 
that can be utilized for breeding, genetic diversity and genome evolution studies. Taking advantage of the 
completely sequenced genomes of Brassica rapa and B. oleracea, characterization of mTEs and building a curated 
database are prerequisite to extending their utilization for genomics and applied fields in Brassica crops. 

Findings: We have developed BrassicaTED as a unique web portal containing detailed characterization information 
for mTEs of Brassica species. At present, BrassicaTED has datasets for 41 mTE families, including 5894 and 6026 
members from 20 MITE families, 1393 and 1639 members from 5 TRIM families, 1270 and 2364 members from 16 
SINE families in B. rapa and B. oleracea, respectively. BrassicaTED offers different sections to browse structural and 
positional characteristics for every mTE family. In addition, we have added data on 289 MITE insertion polymorphisms 
from a survey of seven Brassica relatives. Genes with internal mTE insertions are shown with detailed gene annotation 
and microarray-based comparative gene expression data in comparison with their paralogs in the triplicated B. rapa 
genome. This database also includes a novel tool, K BLAST (Karyotype BLAST), for clear visualization of the locations for 
each member in the B. rapa and B. oleracea pseudo-genome sequences. 

Conclusions: BrassicaTED is a newly developed database of information regarding the characteristics and potential 
utility of mTEs including MITE, TRIM and SINEs in B. rapa and B. oleracea. The database will promote the development 
of desirable mTE-based markers, which can be utilized for genomics and breeding in Brassica species. BrassicaTED will 
be a valuable repository for scientists and breeders, promoting efficient research on Brassica species. BrassicaTED can 
be accessed at http://im-crop.snu.ac.kr/BrassicaTED/index.php. 

Keywords: Brassica, Miniature inverted-repeat transposable element (MITE), Terminal-repeat retrotransposon in 
miniature (TRIM), Miniature form transposable elements (mTEs), Short interspersed elements (SINEs), TE Database 
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Findings 

Background 

Transposable elements (TEs) are ubiquitous and com- 
pose a large fraction of most of eukaryotic genomes. For 
example, more than 80% of the maize genome [1] and 
40% of the B. rapa genome are occupied by transposons 
[2]. TEs can be mobilized and accumulated in the 
genome through cut and paste mechanism or a replica- 
tive transposition mechanism. Transposition of TEs can 
cause gene knock-out or rearrangement, altered regula- 
tion of expression, acquisition of novel function, and/or 
gene structure modification [3-5]. Although plant ge- 
nomes have large numbers of TEs, most of them remain 
as non-autonomous TEs, which cannot transpose with- 
out the assistance of a "partner" element providing 
transposition enzymes, because of accumulation of mu- 
tations, incomplete transposition processes or genetic 
and epigenetic control [4,6]. Miniature form transpos- 
able elements (mTEs) are especially small (<800 bp) TEs 
that are present in high copy numbers. They lack func- 
tional coding genes and rely on £raws-activation by long 
and complete counterpart autonomous TEs [3,7-9]. The 
mTEs include the terminal repeat retrotransposon in 
miniature (TRIM), short interspersed element (SINE) 
and miniature inverted-repeat transposable element 
(MITE) [4,8-10] families. 

TRIMs are derivatives of retrotransposons from which 
the coding domain for transposition has been lost. TRIMs 
have terminal direct repeats (TDRs) (100-250 bp) flanked 
by 5-bp target site duplications (TSDs) and contain an in- 
ternal sequence that begins with the signature region of 
the primer binding site (PBS) of tRNA-methionine and 
ends with a polypurine tract (PPT) motif [8,11,12]. Studies 
have suggested that amplification and mobilization of 
TRIM elements are controlled by autonomous partner 
elements like LTR retrotransposons. TRIMs have been 
identified in various plant species and are present at mod- 
erately high levels, with 2000 copies, in the Brassica gen- 
ome [8,11,13]. TRIM element-based markers have been 
effective for high quality mapping in Brassica [11]. 

Short interspersed elements (SINEs) are non-autonomous, 
non-LTR retrotransposons that are transcribed by RNA 
polymerase III and widely distributed in eukaryotic organ- 
isms [10,14]. SINEs have unique structural features, with 
so-called head, body and tail sequences, and have been 
found in higher copy numbers in animals than in 
plants [15-17]. For instance, the Alu SINE family is 
present in > 1,500,000 copies in the human genome and 
covers > 11% of the total genome [14,18,19]. Propagation, 
maintenance and movement of SINEs are controlled by 
long-interspersed nucleotide elements (LINEs). To date, 
210 SINE families from various plant and animal genomes 
including 16 SINE families related to Brassicaceae have 
been reported in SINEBase [20]. 



MITEs are small, non-autonomous class II DNA trans- 
posons that are AT-rich and have DNA structures charac- 
teristic of TEs, like terminal inverted repeats (TIRs) and 
TSD of variable lengths. Based on their TSD, most MITEs 
have been classified into several superfamilies. Stowaway 
superfamily members recognize 2-bp (TA) nucleotides 
and duplicate these at their terminal region via insertion. 
Similarly, Tourist superfamily members produce unique 
TAA duplications, hAT superfamily members produce 5-, 
6-, or 8 -bp TSD, Mutator superfamily members produce 
9- or 10-bp TSD, and En/Spm superfamily members pro- 
duce 3-bp non-unique TSD [4]. Available evidence sug- 
gests that MITEs originate through either internal 
deletion or cross mobilization from autonomous transpos- 
able elements [3]. MITEs can amplify into high copy num- 
bers, although the MITEs are thought be mobilized 
through cut-and-paste mechanisms. Analysis of complete 
information for MITEs on a genome-wide scale will be 
important for a better understanding of how MITE trans- 
position can produce high copy numbers, as well as of 
their influence on genome evolution [21]. 

mTEs are stably maintained and highly dispersed 
throughout the genome, often in association with genie 
regions, where they can alter gene structure and func- 
tion [22,23]. Insertion of MITEs has been reported to 
cause changes in pigmentation and flowering time in po- 
tato and rice, respectively [24,25]. However, most mTE 
insertions may not cause phenotypic changes but can 
alter gene expression and epigenetic patterns [8,26-28]. 
Their high copy number and close association with 
gene-rich regions make mTEs a potential source of mo- 
lecular markers, mTE-based marker systems have been 
successfully developed in various species for genetic 
mapping as well as analysis of genetic diversity, genome- 
wide association and evolution [8,29-34]. 

The genus Brassica (family Brassicaceae) is composed 
of more than 3700 species including important vegetable 
and oil-seed crops such as kimchi-cabbage, cabbage, and 
oilseed rape (B. rapa, B. oleracea and B. napus) [35]. B. 
rapa was the first Brassica vegetable to be sequenced 
and annotated, revealing recent genome triplication at 
13-17 MYA, after divergence from Arabidopsis thaliana 
20 MYA [2], While only 6% of the A. thaliana genome 
is occupied by TEs [36,37], around 38-40% of the B. 
rapa and B. oleracea genome is derived from TEs [2]. 
Genome analysis has suggested that TEs are important 
players not only in terms of genome size increases but 
also for gene and genome evolution, especially in the re- 
cently triplicated Brassica genome [2]. Recently, a plant 
MITE database was created for 41 sequenced plant ge- 
nomes, and 174 MITE families from B. rapa were identi- 
fied using in silico MITE identification tools [38]. 
However, experimental validation of identified mTE fam- 
ilies and genome-scale characterization of their members 
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with biological utility for Brassica species have not been 
developed or included in any databases. Furthermore, 
there is no resource with information regarding the dis- 
tribution of TRIM, SINE and MITE families in various 
genomic regions for the Brassica species. Therefore, we 
aimed to establish a unique web portal containing de- 
tailed characterization of TEs in Brassica species. As a 
starting point, we have investigated the MITE, TRIM 
and SINE families of B. rapa and B. oleracea at the 
whole-genome scale and developed a database, Brassi- 
caTED (Brassica Transposable Elements Database), for 
effective utilization of MITE, TRIM and SINEs in the 
Brassica genome. 

Construction and content 

We developed BrassicaTED as a database to provide 
comprehensive information about mTEs in the B. rapa 
and B. oleracea genomes. This database has three-tier 
architecture, namely a client tier, a middle tier and a 
database tier (Figure 1). The user- friendly interface, i.e. 
the client tier, was developed using PHP (v4.3.9) and 
JavaScript [39-41]. In the database tier, all information 
related to the mTEs is stored in a MySQL (v4.1.20) data- 
base. In the middle tier, an application forces all HTTP 
requests to be processed by an Apache web server 
(Figure 1). The database is currently hosted on a CentOS 
(5.8) Linux operation system. Additionally, we included 
the NCBI BLAST algorithm (version: 2.2.15) for 
sequence-based searches and sequence extractor tools. 

Browse 

In the Browse section, BrassicaTED provides a convenient 
panel in which to browse the structural characteristics, 
family classification, and sequence information with copy 
numbers in B. rapa and B. oleracea and A. thaliana for 
mTEs. It currently contains information for 20 MITE fam- 
ilies, 5 TRIM families and 16 SINE families characterized 
in the whole genomes of B. rapa and B. oleracea. We de- 
veloped this section to be fully updatable as more mTEs 
are identified and characterized (Figure 2 [1-1]). 

Search 

The Search section was developed to allow exploration 
of annotation and analysis for each mTE family in detail. 



In this section, there are five sub-categories: MITE 
Members, TRIM Members, SINE Members, IP Survey, 
and Gene & Expression (Figure 2). 

MITE members 

B. rapa pseudo-chromosome sequences with unanchored 
scaffolds (283 Mb) and gene annotation information 
(version 1.2) were obtained from the publically available 
database [42]. We included 20 MITE families that were 
identified from our studies and others in Brassica 
[8,30,43-45]. Consensus representative sequences of the 
20 MITE families were used to identify their members 
from 283 Mb B. rapa pseudo-chromosome sequences. 
The physical position of each MITE on the B. rapa 
pseudo-chromosome was described using a custom Perl 
script. A total of 5894 MITEs (Br-members) were recently 
identified in the B. rapa genome [45]. Among them, 98% 
could be classified based on the type of genomic sequence 
in which they were located, such as intergenic region, 5'- 
or 3'-UTR, exon or coding sequence (CDS), and intron, 
whereas the remaining 2% were located on unannotated 
scaffolds of the B. rapa genome. 

The recent availability of 385 Mb of B. oleracea pseudo- 
chromosome sequences (version 1.0) [46,47] allowed that 
analysis to expand to the B. oleracea genome. Similar to B. 
rapa, a total of 6026 members (Bo-Members) were anno- 
tated based on physical position and gene annotation in- 
formation for B. oleracea [42]. This work revealed that 
compared with Br-members, very few Bo-members (three) 
are present in the CDS of B. oleracea [45]. 

TRIM members 

In a previous study, we characterized four TRIM families 
(TRIM of Brassicaceae -1, -2, -3, -4) and their mem- 
bers in 96 Mb of B. rapa BAC end sequences and 
434 Mb of B. oleracea shotgun sequences, [8]. Now, we 
additionally characterized a high copy family, TB-5 
(Cassandra), in the pseudo-chromosome sequences of B. 
rapa and B. oleracea [13]. TRIM element insertion was 
analyzed based on physical position information and the 
annotation of the genomes of B. rapa and B. oleracea. In 
total, 1393 and 1639 copies from the five TRIM families 
were identified from B. rapa and B. oleracea whole- 
genome pseudo-chromosome sequences, respectively 
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Figure 1 Three-tier architecture of BrassicaTED. The client tier receives input from users and sends the request to an Apache server. The 
middle tier receives the request, processes it and passes it to the database tier through PHP and MySQL The user request is extracted from 
MySQL as a table and displayed on a browser in the html page. 
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Figure 2 Display of the Browse and Search panels of BrassicaTED. [1-1] The Browse page for the MITE family listing characterized MITEs. [1-0] 
Output display of the selected MITE family. [2-1] Query page to retrieve MITE family members on each chromosome. [2-0] Result display of MITE 
member page. [3-0] Display of all MITE families and MITE insertion polymorphism. [4-1] Query box for Paralogs & Expression tab. [4-0] Paralog 
partners and ortholog gene display for selected genes. 



(Table 1). Distribution analysis of these five TRIM fam- 
ilies shows insertion throughout the chromosomes of 
the B. rapa and B. oleracea genomes. Interestingly, 619 
(44%) and 656 (40%) members reside in or within 2 kb 
of a gene in the B. rapa and B. oleracea genome, re- 
spectively. This information will be highly valuable for 
marker development and characterization of target re- 
gions that have TRIM element insertion. 

SINE members 

We have included the characterization of the members 
of 16 SINE families from Brassicaceae, which we have 
named as SINE of Brassicaceae 1-16. The sequence in- 
formation for 16 families was obtained from SINEBase, 
and 1270 and 2364 members were retrieved from the B. 
rapa and B. oleracea genomes [20]. The insertion of 
SINEs in various genomic locations was characterized 
based on physical position and genome annotation infor- 
mation, showing that 599 (47.1%) and 1154 (48.8%) of 
the members from B. rapa and B. oleracea, respectively, 
were present in close association with genie regions 
(with < 2 kb of a gene) (Table 2). This will be a valuable 
resource from which researchers can develop markers 
for tightly linked genes or regions on the chromosome. 



Insertion polymorphism (IP) survey 

Insertion polymorphism inter- and intra- Brassica species 
due to insertion or absence of a mTE in specific accessions 
was surveyed by PCR using flanking primers [48]. For 
MITE insertion polymorphism (MIP) analysis, 187 and 
145 MITE targets were surveyed from B. rapa and B. oler- 
acea members, respectively [30,45]. Among them, primers 
for 162 Br-members and 127 Bo-members produced clear 
amplicons, of which over than 52% (150) showed MIP in 
the tested Brassica relatives and their MIP information 
were summarized such as Figure 3. BrassicaTED includes 
information for the primers used and polymorphism pro- 
files with the tested accessions. Data for additional inser- 
tion polymorphism surveys of more MITE members and 
for TRIM and SINE members will be added in the future. 



Gene and expression analysis 

BrassicaTED includes information regarding paralogous 
genes in B. rapa and their orthologs from A. thaliana, 
obtained from the B. rapa genome database [42], Single, 
duplicate, and triplicate paralogs and their orthologs in 
A. thaliana are listed according to the gene order in A. 
thaliana. The expression level of B. rapa genes with in- 
ternal MITE insertion was surveyed against the B. rapa 



Table 1 Characteristics of five TRIM families in the B. rapa and B. oleracea genomes 
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a MITE members categorized by their insertion >2 kb from the nearest gene. 
b MITE members categorized by their insertion 500-2000 bp from the nearest gene. 

C MITE members residing in the 500 bp upstream (5'UTR) or downstream (3'UTR) of the start or stop codon, respectively. 
d MITE members found in unanchored scaffolds of the B. rapa genome. 



Table 2 Characteristics of 16 SINE families in the B. rapa and B. oleracea genomes 



SINE No. of SINE members in B. rapa (283 Mb) genome No. of SINE members in B. oleracea (385 Mb) genome Copies in 
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a MITE members categorized by their insertion >2 kb from the nearest gene. 
b MITE members categorized by their insertion 500-2000 bp from the nearest gene. 

C MITE members resid in the 500 bp upstream (5'UTR) or downstream (3'UTR) of the start or stop codon, respectively. 
d MITE members found in unanchored scaffolds of the B. rapa genome. 
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Figure 3 MITE insertion polymorphism analysis shows 
intra- and inter- species polymorphism between B. rapa and B. 
oleracea. Representative analysis of intra (A)- and inter (B)- species 
polymorphism between B. ropo and B. oleroceo. The original gels are 
shown in A and B, whereas (A') and (B') show the corresponding 
images generated based on the gel scores. The accessions used for 
the MIP survey are the same as those in [30]. 



24 k microarray database, which was generated from four 
different stress treatments: cold (4°C), salt (250 mM 
NaCl), drought (air-dry) and ABA (100 uM) [49]. 



Tools 

This section includes tools for basic bioinformatics analysis 
to visualize or map the query sequence onto the chromo- 
some region and perform similarity searches. A sequence 
extractor tool is also provided to extract target sequences 
for further analysis, such as for MIP primer development 
or related gene structure identification (Figure 4). 



homologous chromosomal regions and visualize the result- 
ing distribution pattern. Users can visualize the locations of 
many different query sequences on each chromosome in 
different colors using K BLAST. It is a straightforward 
approach to inspect the distribution or dispersion of any 
DNA sequences (e.g. genes or repeats) on the chromo- 
somes of B. rapa and B. oleracea. We have also included 
A. thaliana genome sequence for use in K BLAST. 

BLAST 

The tool panel offers the standard NCBI BLAST program 
for sequence similarity searches against the genome se- 
quences. Users can match nucleotide or protein sequence 
(s) against database sequences such as whole-genome se- 
quences of B. rapa, B. oleracea, and A. thaliana. 

Sequence extract 

We incorporated a sequence extractor tool using Python 
script. This can be utilized by entering single or multiple 
start and end positions in column tab-delimited format in 
the input query box for specific regions on the selected 
chromosome of B. rapa and B. oleracea. This tool also 
provides an optional input box to add flanking region 
length for input start and end positions. 



K BLAST 



Utility and discussion 



We developed a new tool, which we call K BLAST (Karyotype TEs are important factors affecting genome size and 
BLAST), to map the unique query sequence (s) onto genome evolution. Among those, miniature form TEs 
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(mTEs) are valuable targets for genomics and phylogen- 
etic studies. Comprehensive analysis and tools are desir- 
able for effective utilization of mTEs for molecular 
studies, and were lacking. We developed BrassicaTED 
web as an interface for extensive characterization of 
MITEs, TRIM and SINEs in the Brassica genome. We 
will keep adding newly identified mTE elements to our 
database. 

BrassicaTED provides three sections (Browse, Search, 
and Tools) with user-friendly navigation. First, there are 
three parts of the Browse section: one each for MITEs, 
TRIMs and SINEs. Users can choose any MITE family 
in the list box and click submit' to enter the area per- 
taining to that family (Figure 2 [1-1]). This tab retrieves 
a summary of all MITE families included, such as MITE 
element classification, sequence information, structure, 
total members of a particular family with navigation to 
view the members and their closely related members, 
etc. (Figure 2 [l-O]). Similarly, TRIM and SINE family 
members can also be queried to obtain the family 
characterization by choosing any element from that 
family section. The Browse panel thus allows users to 
find summaries of complete information about families 
of MITEs, TRIMs and SINEs. 

In the Search panel, users can type the name of any 
family in the search box to find its members. Using the 
position drop-down option, users can filter the members 
returned by choosing the required genomic location 
(intergenic, CDS, intron, UTR) and clicking 'Search' for 
retrieval (Figure 2 [2-1]). This retrieves members of the 
selected family in each chromosome, their start and end 
positions on the chromosome, the length of the query 
alignment and e-value threshold, obtained from annota- 
tion (Figure 2 [2-0]). When 'intergenic' is chosen, the 
names of the two genes flanking the element are 
returned. When other options like CDS, intron, 5 'UTR 
and 3 'UTR are chosen, the member located in the cor- 
responding genie regions is given with the gene name. 

Brassica genomes contain triplicated homologous coun- 
terparts relative to the genome of A. thaliana [2,50]. Re- 
cent investigation of BraSto-2 suggested that MITEs may 
have important roles in triplicated Brassica genome evolu- 
tion [30]. BrassicaTED also provides navigation tools to 
determine paralogous partners and microarray-based gene 
expression profiles of individual mTE-inserted genes 
(Figure 2 [2-1]). Transposition of MITEs, TRIM and 
SINEs in various genomic locations can alter the gene 
structure and also gene expression patterns [8,30,44]. 
mTEs associated with genes may cause gene silencing, 
sub-functionalization or neo-functionalization. The infor- 
mation regarding mTE position and annotation of genes 
harboring mTE insertion will be helpful for association 
studies and candidate gene approaches by comparison of 
their functional diversity among a range of accessions. 



The Search panel also includes IP survey, which is a 
valuable source for genomics, genetic diversity studies, 
and evolutionary analysis [31,51,52]. Currently, there are 
about 18349 IP targets from 41 mTEs known in the 
Brassica genome and >50-85% of them are related to 
genie regions. The accumulated IP marker information 
from 289 MITE targets (Figure 2 [3-0]) will be effect- 
ively used for Brassica genomics studies. We surveyed 
289 MIP targets that provided good genetic diversity 
based on presence or absence of a MITE in the target 
region among different accessions. MIP survey profiles 
in seven different Brassica derivatives were scored as 1, 
2 and 3 for insertion as a full site, non-insertion as an 
empty site and both insertion and non-insertion as 
corresponding full and empty sites, respectively. The gel 
analysis has also been included as a link and users can 
view the gel profile with graphical images and accession 
information by clicking the link (Figure 3). This infor- 
mation will help researchers choose MIP primers for 
diversity studies using their own population. We will 
continue to update our database with more IP analysis 
of mTE targets. We also welcome other researchers to 
submit their data regarding mTEs of Brassica species. 

The next sub-section of the Search panel is 'Gene & 
Expression'. As a mesopolyploid species, the B. rapa 
genome has been estimated to contain 41,174 protein 
coding genes in triplicated form compared to the 
genome of A thaliana [2]. Our study on BraMi-1 MITE 
insertion suggests that MITEs may modify the structure 
of one copy among such triplicated genes and eventually 
its expression pattern [8,30]. The 'Gene & Expression' 
analysis provides information regarding paralogous 
genes and 24 k microarray expression profiles for any 
gene of B. rapa. Under this sub-section, users can find 
differential expression patterns of mTE-inserted genes 
via comparison with expression of their paralogs. Users 
can obtain the available paralogous partner and gene se- 
quences in FASTA format for any B. rapa gene by enter- 
ing the gene ID (Figure 2 [4-1]). Clicking the 'Expression' 
link navigates to another page where the user can select 
expression profiles derived from four stress treatments: 
cold, salt, drought and ABA (Figure 2 [4-0]). The expres- 
sion level of the selected genes will be shown with out- 
put as a graph or a table. This feature will be updated 
with more microarray datasets and can be used to 
analyze expression differences for any gene of interest. 

In the Tools section using K BLAST, users can visualize 
the distribution of query sequences with user-defined 
specific options. This program contains options similar to 
NCBI BLAST but it has been updated with several new 
features (Figure 4 [A]). Users can perform K BLAST 
searches against all of the chromosomes or any particular 
chromosome of B. rapa, B. oleracea, or A. thaliana by 
selecting the chromosome number in the 'Chromosome 
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Select' option. Furthermore, users can filter the output based 
on alignment coverage and sequence identity from each 
query by choosing a range between 60 and 100%. In K 
BLAST, up to 25 queries can be visualized on chromosome 
sequences simultaneously. An example shows the K BLAST 
output using three different mTE families (BraSto-3 TB-1, 
and SB- 10) (Figure 5). The NCBI BLAST algorithm retrieves 
the matched database sequences in a flat file format, and 
even though it displays a graphical representation, it cannot 
show the overall positions precisely, unlike K BLAST. In 
addition, K BLAST offers options for filtering the query 
coverage and the percentage of sequence identity, which can 
be highly useful for research. 

BrassicaTED also offers a sequence extractor tool. With 
this tool, the user can select the chromosome number in 
the drop-down menu, input the start and end position(s) in 
the input text box, and click 'Extract' to retrieve the 
sequences. This returns an output of extracted sequences 
in FASTA format with defined positions. We have provided 
an example of the input position on the program as 
reference for the users (Figure 4 [C]). This tool will be 
useful for users wanting to extract target region(s) with 
flanking sequence for designing primers or further analyses. 

In sum, we have characterized members of 41 mTE 
families using the whole-genome pseudo-chromosome 



sequences of B. rapa and B. oleracea, which allowed us 
to develop a database to promote effective utilization of 
TEs. Previously developed databases for B. rapa such as 
BRAD mostly handle distribution and mining of whole- 
genome data [42]. Similarly, the B. oleracea genome 
database (Bolbase) [46] was designed to provide genomic 
data, which mainly includes whole genome sequence, 
annotation and synteny comparison with B. rapa and A. 
thaliana. Though recently reported database for plant 
MITEs, P-MITE, has data for MITE families from 41 
plant species, it does not provide specific information 
regarding MITE insertion positions in the B. rapa and 
B. oleracea genomes. By contrast, BrassicaTED is built 
to establish an interactive web platform for all types of 
mTEs and includes in-depth analysis of 41 mTE families 
in B. rapa and B. oleracea, as well as in A. thaliana. In 
addition, BrassicaTED provides tools to extract gene 
sequences and compare the expression of paralogous 
genes using the 24 k microarray datasets. 

Conclusion 

BrassicaTED is a new database exclusively made with mTEs 
of Brassica species, especially from B. rapa and B. oleracea. 
This will be an important repository to promote the 
utilization of mTEs and the elucidation of the effects of 
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mTEs on genome evolution in Brassica species. Unlike 
other publically available Brassica databases, BrassicaTED 
has a unique user-friendly visualization tool (K-BLAST) 
and a microarray expression data comparison tool for B. 
rapa. BrassicaTED will be a valuable storehouse for scien- 
tists and breeders who work on Brassica species and will be 
continuously updated as more data is uncovered. 

Availability and requirements 

Datasets in BrassicaTED are freely accessible for re- 
search purposes for non-profit and academic organiza- 
tions at http://im-crop.snu.ac.kr/BrassicaTED/index.php. 
The database is optimized for Internet Explorer, Mozilla 
Firefox, Google Chrome and Safari. 
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